Sub-optimal tracking in switched systems with fixed final time and fixed mode sequence using reinforcement learning

نویسندگان

چکیده

Approximate dynamic programming is used to solve optimal tracking problems in switched systems with controlled subsystems and fixed mode sequence. Two feedback control solutions are generated such that the system tracks a desired reference signal, switching instants sought. Simulation results provided illustrate effectiveness of solutions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fixed-final-time optimal tracking control of input-affine nonlinear systems

In this study, approximate dynamics programming framework is utilized for solving the Bellman equation related to the fixed-final-time optimal tracking problem of input-affine nonlinear systems. Convergence of the weights of the neurocontroller in the proposed successive approximation based algorithms is provided and the network is trained to provide the optimal solution to the problems with a)...

متن کامل

Fixed vs Dynamic Sub-transfer in Reinforcement Learning Technical report

We survey various transfer methods in Q-learning, a type of reinforcement learning, and present a variation on fixed sub-transfer which we call dynamic sub-transfer. We describe the pros and cons of dynamic sub-transfer as compared with the other transfer methods, and we describe qualitatively the situations where this method would be preferred over the fixed version of sub-transfer.

متن کامل

Fixed vs. Dynamic Sub-Transfer in Reinforcement Learning

We survey various task transfer methods in Qlearning and present a variation on fixed sub-transfer which we call dynamic sub-transfer. We discuss the benefits and drawbacks of dynamic sub-transfer as compared with the other transfer methods, and we describe qualitatively the situations where this method would be preferred over the fixed version of sub-transfer. We test this method against sever...

متن کامل

Fixed-final-time optimal control of nonlinear systems with terminal constraints

A model-based reinforcement learning algorithm is developed in this paper for fixed-final-time optimal control of nonlinear systems with soft and hard terminal constraints. Convergence of the algorithm, for linear in the weights neural networks, is proved through a novel idea by showing that the training algorithm is a contraction mapping. Once trained, the developed neurocontroller is capable ...

متن کامل

Optimal Sliding-Mode Guidance Law for Fixed-Interval Propulsive Maneuvers

An optimal strategy based on minimum effort control and also with terminal positionconstraint is developed for an exoatmospheric interceptor with a fixed- interval guidance time. It isthen integrated with sliding-mode control theory to drive an optimal sliding-mode guidance law forfixed-interval guidance time. In addition, this guidance law is generalized for intercepting anarbitrarily time-var...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Neurocomputing

سال: 2021

ISSN: ['0925-2312', '1872-8286']

DOI: https://doi.org/10.1016/j.neucom.2020.09.011